CLIPS-LSR Experiments at TRECVID 2006

نویسندگان

Stéphane Ayache

Jérôme Gensel

Georges M. Quénot

چکیده

This paper presents the systems used by CLIPSIMAG and LSR-IMAG laboratories for their participation to TRECVID 2006 and the obtained results. Shot boundary detection was performed using a system based on image difference with motion compensation and direct dissolve detection. This system gives control of the silence to noise ratio over a wide range of values and for an equal value of noise and silence (or recall and precision), the F1 value is 0.805 for all types of transitions, 0.833 for cuts and 0.727 for gradual transitions. High level feature detection was performed using networks of SVM classifiers arranged in a variety of architectures and taking into account a variety of low level descriptors combining text, local and global information as well as conceptual context. The inferred average precision of our first run is 0.088. The search system uses a user controlled combination of five mechanisms: keywords, similarity to example images, semantic categories, similarity to already identified positive images, and temporal closeness to already identified positive images. The mean average precision of the system (with the most experienced user) is 0.184. 1 Shot Boundary Detection The CLIPS-IMAG team have participated to the Shot Boundary Detection (SBD) task with little modifications from previous participations. The system detects “cut” transitions by direct image comparison after motion compensation and “dissolve” transitions by comparing the norms of the first and second temporal derivatives of the images. It also contains a module for detecting photographic flashes and filtering them out as erroneous cuts and a module for detecting additional cuts via a motion peak detector. The precision versus recall or noise versus silence tradeoff is controlled by a global parameter that modifies in a coordinated manner the system internal thresholds. The system is organized according to a (software) dataflow approach and Figure 1 shows its architecture. Very little modification was made relatively to the previous versions of the system, only minor adjustments of control parameter. 1.1 Cut detection by Image Comparison after Motion Compensation This system was originally designed to evaluate the interest of using image comparison with motion compensation for video segmentation. It has been complemented afterward with a photographic flash detector and a dissolve detector. 1.1.1 Image Difference with Motion Com-

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

CLIPS-LSR-NII Experiments at TRECVID 2005

This paper presents the systems used by CLIPSIMAG laboratory. We participated to shot segmentation and high-level extraction tasks. We focus this year on High-Level Features Extraction task, based on key frames classification. We propose an original and promising framework for incorporating contextual information (from image content) into the concept detection process. The proposed method combi...

متن کامل

Learning TRECVID'08 High-Level Features from YouTube

Run No. Run ID Run Description infMAP (%) training on TV08 data 1 IUPR-TV-M SIFT visual words with maximum entropy 6.1 2 IUPR-TV-MF SIFT with maximum entropy, fused with color+texture and motion (NN matching) 5.9 3 IUPR-TV-S SIFT visual words with SVMs 5.3 4 IUPR-TV-SF SIFT with SVMs, fused with color+texture and motion (NN matching) 6.3 training on YouTube data (no use of standard training set...

متن کامل

TRECVID 2004: Video Search Experiments at IUB

The experiments presented in this paper explore topics surrounding video information retrieval (IR). This paper will discuss in detail our participation at TRECVID 2004. A video retrieval system named ViewFinder was developed to search and browse the TRECVID 2004 test data, and both manual and interactive search experiments were carried out. Each of the performed search experiments were in agre...

متن کامل

Anchor Shot Detection in TRECVID-2005 Broadcast News Videos

In this paper, we discuss a new method for detecting anchor shots in broadcast news video. Our approach makes use of face information that includes the position, the size and the number of faces detected in the image frame. To alleviate the adverse effects caused by occasional face detection errors, we propose two new ideas based on multiple queries and zero face padding. Our experiments over T...

متن کامل

REGIMVID at TRECVID 2009: Semantic Access to Multimedia Data

In this paper we describe our TRECVID 2009 video retrieval experiments. The REGIMVID team participated in two tasks: High Level Feature Extraction and Automatic Search. Our TRECVID 2009 experiments focus on increasing the robustness of a small set of sensors and the relevance of the results using a probabilistic weighting of learning examples.

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2007

CLIPS-LSR Experiments at TRECVID 2006

نویسندگان

چکیده

منابع مشابه

CLIPS-LSR-NII Experiments at TRECVID 2005

Learning TRECVID'08 High-Level Features from YouTube

TRECVID 2004: Video Search Experiments at IUB

Anchor Shot Detection in TRECVID-2005 Broadcast News Videos

REGIMVID at TRECVID 2009: Semantic Access to Multimedia Data

عنوان ژورنال:

اشتراک گذاری